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In the Claims 

1. (Currently Amended) A processor-readable medium comprising 
processor-executable instructions for personalizing karaoke, the processor- 
executable instructions comprising instructions for: 

segmenting visual content to produce a plurality of sub-shots; 

segmenting music to produce a plurality of music sub-clips , wherein the 
segmenting establishes boundaries between the music sub-clips at beat positions 
within the music ; and 

displaying at least some of the plurality of sub-shots as a background to 
lyrics associated with the plurality of music sub-clips. 

2. (Original) The processor-readable medium as recited in claim 1 } 
additionally comprising instructions for: 

shortening some of the plurality of sub-shots to a length of a corresponding 
music sub-clip from within the plurality of music sub-clips. 

3. (Original) The processor-readable medium as recited in claim 1, 
wherein segmenting the visual content comprises instructions for: 

dividing a shot into two sub-shots at a maximum peak of a frame difference 
curve; and 

repeating the dividing to result in sub-shots shorter than a maximum sub- 
shot length. 
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4. (Original) The processor-readable medium as recited in claim 1, 
additionally comprising instructions for: 

filtering the plurality of sub-shots according to importance; and 
filtering the plurality of sub-shots according to quality. 

5. (Currently Amended) The processor-readable medium as recited in 
claim 4, wherein filtering the plurality of sub-shots according to quality comprises 
instructions for: 

examining color entropy within each of the plurality of sub-shots for 
indications of diffusion of color; and 

if color entropy is low, analyzing each of the plurality of sub-shots to detect 
motion more that than a threshold indicating interest and less than a threshold 
indicating low camera and/or object movement; 

selecting sub-shots having acceptable motion and/or color entropy scores. 

6. (Original) The processor-readable medium as recited in claim 4, 
wherein filtering the plurality of sub-shots according to importance comprises 
instructions for: 

evaluating frames within a sub-shot according to attention indices; and 
averaging the attention indices for the frames to determine if the sub-shot 
should be included or excluded. 
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1 7. (Original) The processor-readable medium as recited in claim 4, 

2 wherein filtering the sub-shots according to importance comprises instructions for: 

3 analyzing for camera motion, for object motion and for specific objects 

4 within the sub-shots; 

5 filtering the sub-shots according to the analysis. 

6 

7 8. (Original) The processor-readable medium as recited in claim 1, 

wherein the instructions for segmenting visual content segment video. 

9. (Original) The processor-readable medium as recited in claim 8, 
additionally comprising instructions for: 

selecting important sub-shots from within the plurality of sub-shots; and 
selecting sub-shots such that they are uniformly distributed within the 

video. 

10. (Original) The processor-readable medium as recited in claim 9, 
wherein selecting important sub-shots comprises instructions for: 

evaluating color entropy, camera motion, object motion and object 
detection; and 

selecting the important sub-shots based on the evaluation. 
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11. (Original) The processor-readable medium as recited in claim 9, 
wherein selecting uniformly distributed sub-shots comprises instructions for: 

evaluating normalized entropy of the sub-shots along a time line of video 
from which the sub-shots were obtained. 

12. (Original) The processor-readable medium as recited in claim 1, 
wherein the instructions for segmenting visual content includes instructions for 
assigning photographs to be sub-shots. 

13. (Original) The processor-readable medium as recited in claim 12, 
wherein the instructions for assigning photographs includes instructions for: 

rejecting photographs having problems with quality; and 
rejecting photographs within a group of very similar photographs wherein a 
photo within the group has been selected. 

14. (Original) The processor-readable medium as recited in claim 12, 
wherein the instructions for assigning photographs includes instructions for: 

converting at least one of the photographs to video. 

15. (Original) The processor-readable medium as recited in claim 1, 
wherein the visual content comprises home video and photographs in digital 
formats. 

16. (Cancel) 
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1 17. (Original) The processor-readable medium as recited in claim 1, 

2 wherein segmenting music into the plurality of music sub-clips comprises 

3 instructions for bounding music sub-clip length according to: 

4 minimum length = min{max{2* tempo,2},4} and 

5 maximum length = minimum + 2. 

6 

7 18. (Original) The processor-readable medium as recited in claim 1, 

s wherein segmenting the music comprises instructions for: 

9 establishing music sub-clips' length within a range of 3 to 5 seconds. 

10 

11 19. (Original) The processor-readable medium as recited in claim 18, 

12 wherein segmenting the music comprises instructions for: 

13 establishing boundaries for the music sub-clips at sentence breaks. 

15 20. (Original) The processor-readable medium as recited in claim 1, 

16 additionally comprising instructions for: 

17 obtaining the lyrics from a file; and 

is coordinating delivery of the lyrics with the music using timing information 

19 contained within the file. 

20 

21 21. (Original) A processor-readable medium as recited in claim 20, 

22 wherein obtaining the lyrics comprises instructions for sending the file over a 

23 network to a karaoke device as a part of a pay-for-play service. 



7 



S/N 10/723,049 



Response to Office Action Dated 04/06/2007 



22. (Original) The processor-readable medium as recited in claim 1, 
additionally comprising instructions for: 

querying a database of songs by humming a portion of a desired song; and 
selecting the desired song from among a number of possibilities suggested 
by an interface to the database. 

23. (Currently Amended) A processor-readable medium comprising 
processor-executable instructions for providing integrating lyrics, for integration 
with— music and video content suitable for karaoke, the processor-executable 
instructions comprising instructions for: 

receiving a request for a file associated with a specified song, wherein the 

filer 

associates e ach syllable contained within the lyrics with timing 

valu e s; and 

associat e s each sentence contain e d within the comprises, music. 
lyrics , and with timing values associated with the lyrics ; and 

fulfilling the request for the file by sending the file associated with the 
specified song, song; 

segmenting the music to produce a plurality of music sub-clips, wherein the 
segmenting establishes boundaries between the music sub-clips at beat positions 
within the music; 

segmenting visual content to produce a plurality of sub-shots of a length 
corresponding music sub-clips from the plurality of music sub-clips; and 
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outputting the plurality of music sub-clips together with corresponding sub- 
shots of visual content, wherein the visual content is configured as a background 
to the lyrics associated with the music sub-clips. 

24. (Original) A processor-readable medium as recited in claim 23, 
wherein obtaining the lyrics comprises instructions for sending the file over a 
network to a karaoke device. 

25. (Currently Amended) A personalized karaoke device, comprising: 

a music analyzer configured to segment music to produce a plurality of 
music sub-clips, wherein the segmenting establis hes boundaries between the music 
sub-clips at beat positions within the music of a song creato music sub - clip s- of 
varying lengths according to a song ; 

a visual content analyzer configured to define and select visual content sub- 
shots; 

a lyric formatter configured to time delivery of syllables of lyrics of the 
song; and 

a composer configured to assemble the music-sub clips with the visual 
content sub-shots, and configured to adjust length of the sub-shots to correspond 
to the music sub-clips, and configured to superimpose the syllables of the lyrics of 
the song over the sub-shots. 
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26. (Original) The personalized karaoke device of claim 25, wherein the 
music analyzer is configured to segment the song with a strong onset between 
each of the music sub-clips. 

27. (Original) The personalized karaoke device of claim 25, wherein the 
music analyzer is configured to segment the song with a beat between each of the 
music sub-clips. 

28. (Original) The personalized karaoke device of claim 25, wherein the 
music analyzer is configured to segment the song automatically into sub-clips, 
each having a duration that is a function of song tempo. 

29. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to segment video into sub-shots. 

30. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to access folders of home video and 
photographs containing content from which the sub-shots are derived. 

31. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to assemble still photographs, each of which 
is a sub-shot. 
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32. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to select from among sub-shots according to 
ranked importance, wherein importance is gauged by detection of color entropy, 
detection of object motion within the sub-shot, detection of camera motion during 
the sub-shot, and/or detection of a face within the sub-shot. 

33. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to filter out sub-shots having low image 
quality as measured by low entropy and low motion intensity. 

34. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to select sub-shots of greater importance 
consistent with creating a uniform distribution of the sub-shots over a runtime of a 
source video. 

35. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to reject photographs of low quality by 
detecting over and under exposure, overly homogeneous images and blurred 
images. 

36. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to organize photographs by date of exposure 
and by scene, thereby obtaining photographs having a relationship. 
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37. (Currently Amended) The personalized karaoke device of ekim 
^? claim 36 , wherein the visual content analyzer is configured to reject 
photographs which are members within a group of very similar photographs, 
wherein one of the group has already been selected. 

38. (Original) The personalized karaoke device of claim 25, wherein the 
visual content analyzer is configured to: 

detect an attention area within a photograph; and 

create a photo to video sub-shot based on the attention area, wherein the 
video includes panning and/or zooming. 

39. (Original) The personalized karaoke device of claim 25, wherein the 
lyric formatter is configured to consume a file detailing timing of each syllable 
and each sentence of the lyrics. 

40. (Currently Amended) An apparatus, comprising: 

means for creating music sub-clips by segmenting the music to define 
boundaries between the music sub-clips at beat positions of varying l e ngths 



according to within a song; 



means for defining and selecting visual content sub-shots; 
means for timing delivery of syllables of lyrics of the song; and 
means for assembling the music sub-clips with the visual content sub-shots, 
and to adjust length of the sub-shots to correspond to length of the music sub- 
clips, and to superimpose the syllables of the lyrics of the song over the sub-shots. 
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41. (Original) The apparatus of claim 40, wherein the means for defining 
and selecting visual content sub-shots is a video analyzer configured to segment 
video into sub-shots. 

42. (Original) The apparatus of claim 40, wherein the means for defining 
and selecting visual content sub-shots is a video analyzer configured to access 
folders of home video and photographs containing content from which the sub- 
shots are derived. 

43. (Original) The apparatus of claim 40, wherein the means for defining 
and selecting visual content sub-shots is a video analyzer configured for: 

detecting an attention area within a photograph; and 
creating a photo to video sub-shot based on the attention area, wherein the 
video includes panning and zooming. 

44. (Original) The apparatus of claim 40, wherein the means for timing 
delivery of syllables of lyrics of the song is a lyric formatter configured for 
consuming a file detailing timing of each syllable and each sentence of the lyrics 
and for rendering the lyrics syllable by syllable. 
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